Lexical stress detection for L2 English speech using deep belief networks
نویسندگان
چکیده
This paper investigates lexical stress detection for L2 English speech using Deep Belief Networks (DBNs). The features of the DBN used in this work include the syllable-based prosodic features (assumed to have Gaussian distribution) and their expected lexical stress (assumed to have Bernoulli distribution). As stressed syllables are more prominent than their neighbors, the two preceding and two following syllables are taken into consideration. Experimental results show that the DBN achieves an accuracy of about 80% in syllable stress classification (primary/secondary/no stress) for words with three or more syllables. It outperforms the conventional Gaussian Mixture Model and our previous Prominence Model by an absolute accuracy of about 8% and 4%, respectively.
منابع مشابه
Automatic lexical stress and pitch accent detection for L2 English speech using multi-distribution deep neural networks
This paper investigates the use of multi-distribution deep neural networks (MD-DNNs) for automatic lexical stress detection and pitch accent detection, which are useful for suprasegmental mispronunciation detection and diagnosis in second-language (L2) English speech. The features used in this paper cover syllable-based prosodic features (including maximum syllable loudness, syllable nucleus du...
متن کاملAutomatic Classification of Lexical Stress in English and Arabic Languages Using Deep Learning
Prosodic features are important for the intelligibility and proficiency of stress-timed languages such as English and Arabic. Producing the appropriate lexical stress is challenging for second language (L2) learners, in particular, those whose first language (L1) is a syllable-timed language such as Spanish, French, etc. In this paper we introduce a method for automatic classification of lexica...
متن کاملIntegrating acoustic and state-transition models for free phone recognition in L2 English speech using multi-distribution deep neural networks
This paper investigates the use of Multi-Distribution Deep Neural Networks (MD-DNNs) for integrating acoustic and statetransition models in free phone recognition of L2 English speech. In Computer-Aided Pronunciation Training (CAPT) system, free phone recognition for L2 English speech is the key model of Mispronunciation Detection and Diagnosis (MDD) in the cases of allowing freely speaking. A ...
متن کاملProsodic Differences between Taiwanese L2 and North American L1 speakers— Under-differentiation of Lexical Stress
Assuming that categorical differentiation is major acoustic characteristics of English lexical stress through binary instead of more complex 3-way distinction, we investigated lexical stress in broad and narrow focus positions and found how binary distinction is achieved by the concomitancy of secondary stress defined by its position and distance in relation to primary stress. Similar results a...
متن کاملEnglish Lexical Stress and Spoken Word Recognition in Korean Learners of English
Two experiments explore how Korean-speaking L2 learners of English process English lexical stress during spoken word recognition. Korean doesn't employ lexical-level prosodic distinctions like English lexical stress, but it has phrase-level prosodic structure ((T) HLH), with the initial tone determined by the phonation type of phrase-initial sound. Results from eye-tracking and gating experimen...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013